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HEPATITIS C SURROGATE VIRUS FOR TESTING THE ACTIVITY 
OF HEPATITIS C VIRUS PROTEASE, A RECOMBINANT GENE 

AND A USE THEREOF 

BACKGROUND OF THE INVENTION 
Field of the Invention 

The present invention relates to a hepatitis C virus (HCV), more 
particularly, is related to a hepatitis C surrogate virus which comprises poliovirus 
genome, HCV protease and its target site, and is suitable for testing efficacy of 
anti-HCV drugs and screening the same, a recombinant gene and a use thereof. 

Description of the Related Arts 

There are known some types of viruses which cause viral hepatitis. 
Hepatitis A virus is an RNA virus having a diameter of 27 ran and causing 
epidemic hepatitis through oral infection. Hepatitis B virus is a DNA virus having 
a diameter of 42 nm and causing hepatitis through blood infection. A hepatitis 
virus which does not belong to any of the above types was called non-A, non-B 
hepatitis virus. 

Hepatitis C virus (HCV) is the major etiologic agent of non-A, non-B 
hepatitis (Alter et a/., N. Engl. J. Med. 321, 1494-1500, 1989; Choo et a/., Science, 
244, 359-362, 1989; Kuo et a/., Science 244, 362-364, 1989). This virus has 
been implicated in liver cirrhosis and hepatocellular carcinoma (Bruix et a/., Lancet 
ii, 1004-1006, 1989; Saito et a/., Proc. Natl. Acad Sci. USA, 87, 6547-6549, 1990). 
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At present, a-interferon is widely used for treating HCV patients. However, only 
about half of the patients respond to a-interferon, and about half of the 
responders suffer from a recurrence of the virus (Hino et a/., C. J. Med. Virol. , 42, 
299-305, 1994; Tsubota et a/., Hepatology, 19, 1088-1094, 1994). Development 
of other anti-HCV drugs is, therefore, necessary. 

In order to develop the anti-HCV drugs, it is essential to set up HCV 
cultivation system to test the efficacy of the drugs. However, in vitro cultivation 
systems available to date are inefficient and technically difficult to manipulate (Yoo 
ef a/., J. Virol. y 69, 32-38, 1995; Beach ef a/ M Viral Hepatitis Research Foundation 
of Japan, 286, 67, 1993; Shimizu ef a/., Proc. Natl. Acad. ScL USA, 89, 5477-5481, 
1992). Furthermore, HCV titer is very low even in sera of hepatitis patients due 
to its nature of low yield. Accordingly, it is very difficult to screen or test for 
efficacy of antiviral drugs using HCV cultivation. Therefore, the surrogate virus 
will be very useful for studying HCV genes in virus replication and. testing antiviral 
efficacy of drugs affecting HCV-encoded enzymes. Namely, the development of 
the surrogate virus for anti-HCV drugs is urgent. 

SUMMARY OF THE INVENTION 

Accordingly, the present invention is intended to overcome the above- 
mentioned disadvantage in the development of anti-HCV drugs and provide 
proper surrogate viruses for HCV suitable for the investigation of antiviral efficacy 
of drugs and proliferate in the HCV cultivation system. 

An embodiment of the present invention is a recombinant gene coding 
hepatitis C surrogate virus comprising a virus gene coding picornavirus, a 
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protease gene coding hepatitis C virus protease NS3 which is in an open reading 
frame (ORF) of said picornavirus, and a target gene coding target sites of said 
hepatitis C virus protease NS3 which is in said open reading frame of said 
picornavirus. 

Among the picornaviruses, the poliovirus is preferred. 

The target site is preferred to be selected from the group consisting of 
NS5A/5B, NS4A/4B and NS4B/5A. 

The hepatitis C surrogate virus protease NS3 is preferred to cleave the 
peptide bond between cysteine of P1 and glycine of P1 ' as a target site. 

In another aspect, the present invention provides a screening method of 
anti hepatitis C virus drugs comprising the step of detecting a material which 
inhibits surrogate virus proliferation expressed by the recombinant gene for coding 
hepatitis C surrogate virus. 

In another aspect, the present invention provides a measuring method of 
anti-hepatitis C virus drugs activity comprising the steps of proliferating the 
surrogate virus expressed by the recombinant gene coding hepatitis C surrogate 
virus in the presence of anti-hepatitis C virus agents, and measuring the 
proliferating ability of the hepatitis C surrogate virus. 

The present invention, in another aspect, provides a hepatitis C surrogate 
virus comprising the recombinant gene for coding hepatitis C surrogate virus. 

The poliovirus's versatility is used in developing surrogate virus suitable 
for investigation of HCV protease activity. 



WO 98/00548 PCI7KR97/00120 

4 

HCV belongs to the family Flaviviridae whose members are enveloped 

* 

viruses having a positive-sense RNA genome (Francki et a/., Arch. Virol 2(Sullo.), 
223, 1991; Inchauspe etal.,Proc. Natl. Acad Sc/., 88, 10292-10296, 1991; Miller 
et a/., Proc. Natl. Acad. ScL t 87, 2957-2061, 1990; Takamizawa et a/., J. Virol. 65, 
1105-1113, 1991). The RNA encodes a polyprotein (-3010 amino acids) with the 
following gene order: 5-CE1-E2-NS2-NS3-NS4A-NS4B-NS5A-NS5B-3'. During 
and/or after synthesis, the polyprotein is processed into functional proteins by 
host- and virus-encoded proteases. Core protein (C) and envelope proteins (E1 
and E2) are believed to compose the structural elements of the virion particle. 
The balance of the processed proteins is believed to function in replication 
processes of the virus inside the host cells. The signal peptidase on the 
endoplasmic reticulum is responsible for the generation of the N-terminus of E1 , 
E2, and possibly NS2. It has been suggested that the NS2/NS3 junction is 
cleaved by an HCV-en coded metalloprotease, which resides between the C- 
terminal portion of NS2 and the N-terminal region of NS3. The cleavage of the 
NS3/4A, NS4A/4B, NS4B/5A, and NS5A/5B sites depends on the activity of the 
serine protease NS3 (Bortenschlager etai, J. Virol. 68, 5045-5055, 1993; Eckart 
et a/., Biochem, Biophys. Res. Commun. 192, 399-406, 1993; Grakoui etai, J. 
Virol. 67, 2832-2843, 1993; Han etai, J. Gen. Virol. 76, 985-993, 1995; Hijikata ef 
a/ M J. Virol. 67, 4665-4675, 1993; Tomei et a/., J. Virol. 67, 4017-4026, 1993). 
HCV protease NS3 is a good putative therapeutic target molecule for the 
development of anti-HCV drugs, since its activity is considered essential for viral 
proliferation. Protease activity of NS3 resides in the N-terminal part of NS3 
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(Bartenschlager et a/., J. Virol. 67, 3835-3844, 1994; Han et a/., J. Gen Virol. 76, 
985-993, 1995; Lin etal. t J. Virol. 68, 8147-8157, 1994), and NS4A enhances the 
protease activity of NS3 (Bartenschlager et a/ M J. Virol. 68, 3835-3844, 1994; 
Failla et a/., J. V7ro/. 68, 3753-3760, 1994; Hahm et a/., J. Wro/. 69, 2534-2539, 
5 1995; Lin et a/., J. Virol. 69, 4373-4380, 1995; Tanji et a/., J. l//ro/. 69, 1575-1581 , 
1995). Characterization of the NS3 protease using known protease inhibitors has 
revealed that NS3 is a chymotrypsin-like protease (Hahm et a/., J. Virol. 69, 2534- 
2539, 1995). 

The poliovims is the prototype of picornaviruses. The polioviral genome 
io is composed of a positive-sense RNA molecule that encodes a single open 
reading frame (ORF) (Kitamura et a/., Nature (London) 291, 547-553, 1981; 
Racaniello et a/. ( Proc. Natl. Acd. Sci. USA 78, 4887-4891, 1981). Upon infection, 
the genomic RNA is translated into a large precursor polyprotein via the internal 
ribosomal entry site (IRES) in the 5-nontrasIated region of the RNA (Jang et a/., J. 
15 Virol. 62, 2636-2643, 1988; Pelletier et a/., Nature (London) 334, 320-325, 1988; 
Pelletiet et a/., J. Virol 63, 441-444, 1989). The polyprotein, in turn, is processed 
into the mature viral structural and nonstructural proteins by the 2A, 3C, and 3CD 
proteases (Harris et a/., Semin. Virol. 1, 323-333, 1991; Lawson and Semler., In 
"Current Topics in Microbiology and Immunology" (V.R. Racaniello, Ed.), 161, 49- 
20 87, 1990). Poliovirus is one of the best studied viruses and is relatively easy to 
manipulate and cultivate. Production of infectious poliovirus from cDNA clones 
has been practiced for a decade to study functions of poliovirus proteins (Molla et 
a/., Science 254, 1647-1651, 1991; Van der Werf et a/., Proc. Natl. Acad. Set. 
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USA 83, 2330-2334, 1986) or for use of the poliovirus as a vector in delivering 
foreign genes into host cells (Alexander et a/. ( Proc. Natl. Acad Sci. USA 91, 
1406-1410, 1994; Andino et a/., Science 265, 1448-1451, 1994; Lu et a/.. J. Virol. 
69, 4797-4806, 1995) 

Inventors generated a hybrid poliovirus which requires the activity of HCV 
protease NS3 for its proliferation, on the basis of the characteristics of HCV and 
poliovirus. The hybrid virus is composed of poliovirus and HCV. The proteins 
necessary for the proliferation and infection into other cells are produced by 
poliovirus gene for the hybrid virus. The poliovirus gene is preceded by HCV 
protease and the sites cleaved by the protease to construct this hybrid virus which 
requires HCV protease NS3 activity for viability. This virus is composed, 
sequentially, HCV NS3 protease domain plus its target site, and open reading 
frame of poliovirus which is necessary for virus replication and virus particle 
formation. The activity of the enzyme NS3 protease is necessary to survive the 
hybrid virus. 

The hybrid virus is proliferate in the cultivation system to settle the 
conventional problem which HCV cannot be proliferated in vitro cultivation 
systems. Therefore, the hybrid virus enables to develop anti-HCV drugs based 
on protease inhibitor easily and precisely. 

DETAILED DESCRIPTION OF THE DRAWINGS 

The present invention will be described with reference to the 
accompanying drawings, in which: 
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FIG. 1 is schematic diagram of the genomic organization of poliovirus type 
1 (Mahoney), NS3 A C-PV1, and NS3 a C-PV1 . 

FIG. 2 is a photograph of plaque phenotypes of PV1 (M) and vNS3 A C- 

PV1. 

FIG. 3 is a photograph of RT-PCT analysis of vNS3 A C-PV1 RNA. 

Lane 1 , 2 and 3 depict DNAs produced by PCR or RT-PCR using as 
template of pNS3 A C-PV1 , (+) sense RNA and (-) sense RNA, respectively. 

FIGs. 4A, 4B and 4C are photographs of proteolytic processing patterns of 
PV1, NS3 A C-PV1, and NS3 A C'-PV1 polyproteins. 

FIG. 5 is one-step graph curves of PV1 (M) and vNS3 a C-PV1 . HeLa 
cells were infected with either PV1 or vNS3 A C-PV1 at a multiplicity of infection 
of 10. 

FIG. 6A to FIG. 6E are shown the genetic sequence of pNS3 A C-PV1 
according to an embodiment of the present invention. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

In the following detailed description, only the preferred embodiment of the 
invention has been shown and described, simply by way of illustration of the best 
mode contemplated by the inventor(s) of carrying out the invention, As will be 
realized, the invention is capable of modification in various obvious respects, all 
without departing from the invention, Accordingly, the drawing and description 
are to be regarded as illustrative in nature, and not as restrictive. 
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EXAMPLE 

Material and Methods 
Construction of plasmids 

DNA manipulations were carried out by standard methods (Sambrook et 
a/., Molecular cloning: a laboratory manual (2nd ed.), Cold Spring Harbor 
Laboratory, Cold Spring Harbor, N.Y., 1989). The construction of the plasmids 
pTHE A 1219-1634 and pPNENPO are described elsewhere (Han et a/. f J. Gen. 
Virol., 76, 985-993, 1995; Alexander etal. t Proc. Natl. Acad ScL, USA, 91, 1406- 
1410, 1994). pUC19 was cleaved with Xbal, treated with Klenow, and digested 
with Pstl. The resulting 2.6 kb fragment was then ligated to the 1.3 kb fragment 
of pTHEAl219-1634 treated with Ncol, Klenow, and Pstl in succession. The 
result was the construct pUC19-NS3 A C-4 A C. pPNENPO and pUC19-NS3 A 
C-4 a C were digested with EcoRI or Kpnl, respectively, and treated with Klenow 
enzyme. The linearized DNAs were ligated to result in the construct pPNENPO- 
NS3 A C-4 A C. 

To construct pNS3 A C-4A-PV1 , a polymerase chain reaction (PCR) was 
performed using the following four oligonucleotides: 
Primer 1: 5-TGATATCGAATTCCGG-3'; 
Primer 2: 5 , -ATATGAGCTCCGCACTCTTCCATCTC-3 , ; 
Primer 3: 5 -ATATGAGCTCAGGTTTCATCACAGAAAG-S 1 ; 
Primer 4: 5-CTGTGCTAGCGCTTTTTG-3\ 

The primers 1 plus 2 and pPNENPO-NS3 A C-4 A C were used to 
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amplify the HCV protease NS3 A C-4A region (PCR product 1). The PGR 

product 1 contains a new Sacl site right after the NS4A-coding sequence. The 
primers 3 plus 4 and pPNENPO were used to amplify polioviral cDNA (746-2479) 
with a new Sacl site at the beginning of the poliovirus ORF (PCR product 2). 
EcoRI and Nhel digested pPNENPO, EcoRI and Sacl digested PCR product 1, 
and Sacl and Nhel digested PCR product 2 were ligated together to generate 
plasmid pNS3 A C-4A-PV1. 

To construct pNS3 A C-PV1 , an oligonucleotide (5'- 

AGCTCCGCAGCAGACGACGTCCTCACTGGCTTCCTCTTGCGCGCCTCCTCCT 
CCGGTTGAGTTATCTGTGAAGAC-3 , ) encoding the C-terminal end of NS5A (P1- 
P10) and primer 1 were used to amplify the HCV region encoding amino acids 
1016-1215. The PCR product was phosphorylated with polynucleotide kinase 
and then digested with EcoRI. pNS3 a C-4A-PV1 was cleaved with Sacl, 
treated with T4 DNA polymerase, and then digested with EcoRI. The larger 
fragment of pNS3 A C-4A-PV1 was ligated to the PCR product to generate pNS3 
a C-PV1. The junction sequences were confirmed by sequencing. 

Site-directed mutagenesis of the serine in the catalytic triad of the HCV 
protease (Ser1165Ala). 

Site-directed mutagenesis was carried out by the Kunket method as 
previously described (Kunkel et a/., Methods EnzymoL, 154, 367, 1987). 
pSK1016-1650 was generated by ligation of the DNA fragments isolated from 
Pvull plus Sail-digested pTHE1016-1846 and Smal plus Sail-digested pBluescript 
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SK(+). E. coli strain RZ 1032 (ungduf) was transformed with pSK1016-1650, and 
ssDNA was prepared from the transformant. An oligonucleotide, 5'- 
CTTGAAGGGCTCCGCG GGTGG-3', designed to change the serine residue in 
the catalytic triad of HCV protease to alanine [Ser 1165(TCG) to Ala(GCG)] was 
used for site directed mutagenesis. The oligonucleotide was phosphorylated with 
T4 polynucleotide kinase (Boehringer Mannheim) and annealed to the single 
stranded pSK1016-1650 DNA, extended with sequenase (USB), and ligated with 
T4 DNA ligase (Poscochem.) to result in covalently closed, circular, dsDNA. The 
ligated dsDNA was introduced into E. coli XL 1 -blue. The base change was 
confirmed by the presence of a newly generated Sacll site and by sequencing. 
pSK1016-1650 (Ser1165Ala) was used to construct pNS3 A C-PV1 by replacing 
the Xmal fragment of pNS3 A C-PV1 with the same fragment of pSK1016-1650 
(Ser1165Ala). 

In vitro transcription and translation 

Plasmid DNAs were purified following the polyethylene glycol precipitation 
method. They were then linearized with appropriate restriction enzymes 
downstream of the translation termination cod on. The linearized DNAs were 
extracted with phenol/chloroform and ethanol-precipitated. RNAs were 
transcribed from the purified DNAs with T7 RNA polymerase (Boehringer 
Mannheim) as described by the manufacturer. These RNA transcripts were 
translated in a rabbit reticulocyte lysate system (RRL) supplemented with 25% 
HeLa cell lysate in the presence of [ 35 S] methionine. The in vitro translation 
reactions were carried out at 30 T: for 8 hours. The [ 35 S]-labeled proteins were 
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analyzed on 15% SDS-PAGE using the buffer system described by Nicklin et al. 
(1987). The intensity of the autoradiographic images was enhanced by 
fluorography using salicylic acid. Gels were dried and exposed to Kodak XAR-5 
film or Agfa Curix RP1 film. 

5 RNA transfection 

An electroporation protocol was used to transfect HeLa cell monolayers 
with the viral RNAs prepared in vitro. Cells grown to 80% confluency were 
harvested. After two washes, the cells were resuspended in the medium at a 
concentration of 5 x 10 6 cells/ml The cell suspension (1 ml) was mixed with 3 

10 vg of RNA and placed in a 0.4 cm Gene Pulser cuvette (BIO-RAD). The cuvette 
was electrically pulsed at 360 volt and 960 //F using the Gene Pulser transfection 
apparatus (BIO-RAD). Then the cells were diluted with 9 urf of medium 
containing 10 % bovine calf serum, 5 % equine serum, and 5 % fetal calf serum. 
Plaque assay and one-step growth curve 

15 Titers of virus stocks were measured by plaque assay on HeLa cell 

monolayers as follows. Cells were inoculated with virus, left for 30 minutes at 
room temperature, and then overlaid with Eagle medium containing 0.9 % Noble 
agar and 5 % fetal calf serum. Viral plaques were visualized with 0.5 % crystal 
violet solution 60 hours postinfection. 

20 To measure one-step growth kinetics, virus was loaded onto 3.5 cm HeLa 

cell plates at a multiplicity of infection of 10 per cell and incubated at room 
temperature for 30 minutes to allow virus binding. Unbound particles were 
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removed by two rounds of PBS washing. The infected cells were cultured at 
37 1C in 2 inC of Eagle medium containing 10 % bovine calf serum and harvested 
at the time points indicated in FIG. 5. The infectivity of the cell lysate at each 
time point was titrated by plaque assay. 

Reverse transcription PCR (RT-PCR) 

Total RNA from virus-infected cells was isolated 4 hours postinfection as 
described by Lu et al. (1995). cDNAs were synthesized using M-MuLV reverse 
transcriptase and oligonucleotide primers corresponding to poliovirus nucleotides 
556-570 and 789-806 (primer 5: S'-GTGTTTCCTTTTATT-S*; primer 6: 5'- 
GTGTTTCCi 1 1 iATT-3'). Primer 5 and primer 6 were used in synthesizing 
cDNAs from negative- and positive-sense RNA, respectively. Both of the primers 
were used in the PCR reaction that followed. The PCR products were analyzed 
on a 1.3 % agarose gel. 

Labeling of newly synthesized proteins in virus-infected cel ls 

HeLa cells were infected with poliovirus 1 (Mahoney: PV1) and vNS3 a 
C-PV1 at a multiplicity of infection of 30 PFU per cell. The cells were incubated 
in a medium lacking methionine for 10 minutes 3 hours after infection. 
Subsequently, 100 u Ci of [ 35 S] methionine was added to the medium and the 
incubation continued for 40 minutes. The cells were then collected and lysed. 
The lysate was analyzed on 15 % SDS-PAGE. 

Immunoblot analysis 

Virus-infected cells were lysed 4 hours postinfection. Total protein (13 
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tig) was resolved on a 15 % SDS-PAGE gel and transferred onto a nitrocellulose 
membrane (Amersham). The blot was incubated overnight at 4 V, in blocking 
solution [20 mM Tris-HCI (pH 7.4), 150 mM NaCI, 0.5 % Tween 20, and 5 % 
powdered skim milk] to block nonspecific binding. The primary antibody 

5 (polyclonal antibody against HCV protease NS3) was added to the blocking 
solution for 3 hours. The antibody was a generous gift from Dr. R. 
Bartenschlager, Institute of Virology, Mainz, Germany. A horseradish 
peroxidase-linked anti-rabbit IgG in a 1:10,000 dilution was used as the secondary 
antibody. Membrane-bound antibodies were detected by enhanced 

io chemiluminescence reagents (ECL; Amersham). 

Generation of a novel poliovirus containing HCV protease NS3 and its 
target site 

In order to develop an evaluation system for anti-HCV drugs targeting 
protease NS3, a recombinant poliovirus that requires the activity of HCV protease 

15 for its proliferation, was constructed. A schematic diagram of the genomic 
structure of this virus is depicted in FIG. 1. The hatched and open boxes represent 
HCV protease NS3 a C and poliovirus poly protein, respectively. The amino acid 
sequence at the junction of HCV protease NS3 and poliovirus P1 is indicated by 
single letter amino acid codes. The target site of HCV protease is marked by an 

20 arrowhead. The mutated amino acid (S1165A) in NS3 a C'-PV1 is indicated by 
an asterisk. 

As shown in FIG. 1, in the case of polyprotein NS3 a C-PV1, the protease 
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domain of HCV NS3 is fused to the last 10 amino acids of NS5A (EEASEDWCC), 
which are recognized by protease NS3. These amino acids residues are 
preceded by four glycines, alanine, and giutamine serving as a hinge region 
between the protease domain and the target site. It was expected that the HCV 
protease NS3 would cleave the peptide bond between cysteine at the C-terminus 
of the additional polypeptide and glycine at the N-terminus of poliovirus P1, since 
HCV protease NS3 is known to cleave between the cysteine or threonine at the 
P1 -residue and the small uncharged residues at the P1 -residue (Kolykhalov et a/., 
J. Virol. 68, 7525-7533, 1994; Komoda etal. t J. Virol. . 68, 7351-7357, 1994). 

Upon transfection, the RNA transcript encoding NS3 A C-PV1 produced a 
virus designated vNS3 A C-PV1. On the other hand, the RNA transcript similar 
to NS3 A C-PV1 but containing a serine to alanine mutation at the catalytic triad 
of HCV protease NS3 (NS3 A C'-PV1) did not yield virus. This strongly suggests 
that HCV protease NS3 activity is essential for the viability of the hybrid virus 
vNS3 A C-PV1. The virus vNS3 A C-PV1 has a small plaque phenotype at 
37 1C as shown in FIG. 2. (compare plaque sizes in panel PV1 with vNS3 A C- 
PV1)orat30 X, (data not shown). 

The virus vNS3 A C-PV1 is stable through at least five passages. The 
integrity of vNS3 A C-PV1 was confirmed by analyzing the viral RNAs in cells 
inoculated with a virus stock of the fourth passage. Lanes 1 , 2 and 3 depict 
DNAs produced by PGR or RT-PCR using as templates plasmid pNS3 A C-PV1, 
(+) sense RNA, respectively. The inserted HCV sequence in the poliovirus 
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genome was amplified by RT-PCT with primers annealing to the poliovirus 5* 
nontranslated region and the P1 -coding region. The viral RNAs of the fifth 
passage retained the inserted HCV protease and NS5A sequences as shown in 
FIG. 3. Both positive and negative sense viral RNAs of the expected sizes were 

5 detected (FIG. 3, lanes 2 and 3, respectively). RT-PCR analysis of the genomic 
RNAs isolated from vNS3 A C-PV1 virion particles revealed the same RT-PCR 
products as the RNAs isolated from HeLa cells (data not shown). The junction 
sequence between HCV and poliovirus was confirmed by DNA-sequencing of the 
PCR products and the sequences of hepatitis C surrogate virus genome is 

10 presented in FIG. 6. This indicates that vNS3 A C-PV1 is stable at least for five 



Proper proteolytic processing of vNS3 A C-PV1 polyprotein requires HCV 

NS3 protease activity. 

Proteolytic processing patterns of polyproteins encoded by poliovirus, 

15 vNS3 A C-PV1, and NS3 A C-PV1 RNA were analyzed by in vitro transcription 
and translation of pT7-PC1, pNS3 A C-PV1, and pNS3 A C'-PV1. Most of the 
polioviral proteins were detected as in vitro translation products of the PV1 RNA 
transcript (FIG. 4A, lane 1). Two additional protein bands of 123 kDa and 26 kDa 
were apparent among the translation products of NS3 A C-PV1 as compared to 

20 the PV1-translation product (FIG. 4A. lane 2). The 123 kDa and 26 kDa proteins 
are most likely NS3 a C-P1 fusion protein and the processed NS3 A C. 
respectively, since the cleavage of poliovirus at the P1/P2 junction occurs very 



passages. 
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fast, even during the process of translation. Translation of vNS3 A C*-PV1 also 
yielded the 123 kDa protein. But the 26 kDa and the P1 protein were not 
detected among the translation products from this construct (FIG. 4A, lane 3). 
This indicates that proteolytic cleavage of the HCV/poliovirus junction does not 
occur in NS3 A C*-PV1 , and that the cleavage of this junction is mediated by the 
protease activity of NS3. It also shows that the failure of virus production with the 
NS3 A C # -PV1 construct is most likely due to the lack of the P1 polypeptide which 
is a protein processed into viral coat protein (VPO, VP1, and VP3). In other 
words, HCV NS3 activity is essential for proliferation of the vNS3AC-PV1. 
Interestingly, complete processing of the viral capsid proteins (VPO, VP1, and 
VP3) from the NS3 A C"-PV1 polyprotein was not detected. On the other hand, 
the processed capsid proteins were apparent along with the precursor NS3 A C- 
P1 (compare lane 2 with lane 3 in FIG. 4A). The blockage of proteolytic 
processing of the NS3 A C'-PV1 polyprotein may be due to the lack of 
myristoylation of the glycine residue at the N-terminus of P1 that has been shown 
to be important in the processing of P1 by poliovirus protease 3C-3D (Krausslich 
et a/., J. ViroL, 64, 2433-2436, 1990). 

The proteolytic processing was also investigated for PV1 and vNS3 A C- 
PV1 by labeling newly synthesized proteins with [ 35 S] methionine in virus-infected 
cells. The labeling in vNS3 a C-PV1 -infected cells showed four additional bands 
with apparent molecular weights of 123 kDa, 89 kDa, 63 kDa and 26 kDa as 
indicated by arrow heads (FIG. 4B, compare lane 2 with lane 3). These proteins 
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are most likely NS3 A C-P1, NS3 A C-VP0-VP3, NS3 A C-VPO, and NS3 A C ( 
respectively (FIG. 4B). This assumption is supported by the Western blotting 
analysis using an antibody against HCV NS3 and HeLa cell extracts after infection 
with either wild type poliovirus or vNS3 A C-PV1 (FIG. 4C). Among the proteins 
5 of vNS3 A C-PV1 -infected cells, the anti-NS3 antibody recognized the 63 kDa 

and the 26 kDa polypeptides corresponding to vNS3 A C-VPO and NS3 A C, 
respectively (FIG. 4C, lane 3). A longer exposure of the X-ray film also revealed 
the 123 kDa band and the 89 kDa band corresponding to NS3 A C-P1 and NS3 

A C-VP0-VP3, respectively (data not shown). On the other hand, no bands were 
10 detected in the Western analysis of cell extracts from wild type poliovirus- or 
mock-infected cells (FIG. 4C, lanes 1 and 2). The results indicate that proteolytic 
processing of the artificial polyprotein occurs as expected, even though the 
HCV/poliovirus junction cleavage happens slower than the P1/P2 junction 
cleavage. The relative intensities of the NS3-related bands varfed depending on 
is the methodologies used to detect the polypeptides. For instance, a strong band 
of NS3 A C-P1 was detected by in vitro and in vivo labeling, but only a weak band 
of IMS3 A C-P1 was detected by Western blot analysis. The p 5 S] methionine 
labeling method reveals only newly synthesized polypeptides and larger proteins, 
containing more methionine residues, give stronger signals. Western blot 
20 analysis, on the other hand, reveals all of the proteins interacting with the primary 
antibody, and the intensity of the signal reflects the molarities of the proteins 
rather than their size. Therefore, [^methionine labeling methods are expected 
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to give relatively stronger signals for precursor forms than Western blot analysis 
methods. 

One-step growth kinetics of chimeric virus vNS3 A C-PV1 

FIG. 5 was one step growth curve for vNS3 A C-PV1. While maximum 
5 production of wild type poliovirus was reached 4 hours postinfection (FIG. 5, open 
squares), that of vNS3 A C-PV1 was delayed by about 2 hours (FIG 5, closed 

circles). In FIG. 5, the titer of vNS3 A C-PV1 at the peak was about 1\5 orders 
of magnitude lower than that of wild type poliovirus, as viral titers at 9 hours 
postinfection was compared. The small plaque phenotype of vNS3 A C-PV1 
10 may thus be a result of delayed virion particle formation and smaller burst size. 

The HCV protease NS3 activity is necessary for noble poliovirus according 
to the present invention to survive. This virus is useful for testing the efficacy of 
anti-virus drugs which targets protease NS3. 
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WHAT IS CLAIMED IS: 

1 . A recombinant gene coding hepatitis C surrogate virus 
comprising: 

a virus gene coding picomavirus; 
5 a protease gene coding hepatitis C virus protease NS3 which is in an 

open reading frame (ORF) of said picomavirus; and 

a target gene coding target sites of said hepatitis C virus protease NS3 
which is in said open reading frame of said picomavirus. 

2. The recombinant gene coding hepatitis C surrogate virus of claim 
10 1, wherein said picomavirus is poiiovirus. 

3. The recombinant gene coding hepatitis C surrogate virus of claim 
1, wherein said target site is selected from the group consisting of NS5A/5B, 
NS4A/4B and NS4B/5A. 

4. The recombinant gene coding hepatitis C surrogate virus of claim 
is 1 , wherein said hepatitis C surrogate virus protease NS3 cleaves the peptide bond 

between cysteine of P1 and glycine of P1 ' as a target site. 

5. A screening method of anti hepatitis C virus drugs comprising the 

step of: 

detecting a material which inhibits surrogate virus proliferation expressed 
20 by said recombinant gene for coding hepatitis C surrogate virus of claim 1 . 

6. A measuring method of anti-hepatitis C virus drugs activity 
comprising the steps of: 
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proliferating the surrogate virus expressed by said recombinant gene 
coding hepatitis C surrogate virus of claim 1 in the presence of anti-hepatitis C 
virus drugs; and 

measuring the proliferating ability of said hepatitis C surrogate virus. 

7. A hepatitis C surrogate virus comprising: 

a recombinant gene for coding hepatitis C surrogate virus of claim 1 . 
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FIG. 3 
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FIG. 4A 



MW 

(kDa) 
200- 

98- 

68 — I 



43 



29 



a. 

o 

< 

CO 



a 
i 

* 

o 

< 
m 

to 
Z 



NS3AC-P1 



P1 

3CD 



2BC 

VPO 
2C.VP1 



VP3 



NS3AC 




1 2 3 



WO 98/00548 



1>CX/J\E97/00120 



5/12 



FIG. 4B 
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FIG. 4C 
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FIG. 6A 

TTAAAACAGCTCTGGGGTTGTACCCACCCCAGAGGCCCACGTGGCGGCTAGTACTCCGGT 
ATTGCGGTACCCTTGTACGC 

CTGTTTTATACTCCCTTCCCGTAACTTAGACGCACAAAACCAAGTTCAATAGAAGGGGG 
TACAAACCAGTAGCACCACGA 

ACAAGCACTTCTGTTTCCCCGGTGATGTCGTATAGACTGCTTGCGTGGTTGAAAGCGACG 
GATCCGTTATCCGCTTATGT 

ACTTCGAGAAGCCCAGTACCACCTCGGAATC7TCGATGCGTTGCGCTCAGCACTCAACCC 
CAGAGTGTAGCTTAGGCTGA 

TGAGTCTGGACATCCCTCACCGGTGACGGTGGTCCAGGCTGCGTTOGCGGCCTACCTATG 
GCTAACGCCATGGGACGCTA 

GTTGTGAACAAGGTGTGAAGAGCCTATTGAGCTACATAAQAATCCTCCGGCCCCTGAAT 
GCGGCTAATCCCAACCTCGGA 

GCAGGTGGTCACAAACCAGTGATTGGCCTGTCGTAACGCGCAAGTCCGTGGCGGAACCGA 
CTACTTTGGGTGTCCGTGTT 

TCCTTTTAT7TTATTGTGGCTGCTTATGGTGACAATCACAGATTGTTATCATAAAGCGA 
AGACGGTATAGCTTGATATCG 

AATTCCGGGGATCCTCTAGCATGGCCAGTCATCATCATCATCATCATGGAATTGTCGAC 
AATTCCCCGGCCGATAGCCTT 

GAAGGGGAGGGGTGGCAACTCCCCGCTCCCATCACGGCCTACTCCCAACAGACGCGGGGCC 

TACTTGGTTGCATCATCAC 

TAGCCTCACAGGCCGGGACAAGAACCAAGTCaAGGGGGAGGTTCAAGTGGTTTCCACCGC 

AACACAATCTTTCCTGGCGA 

CCTGCGTCAATGGCGCTTGGACTGTCTTCCATGGTGGCGGCTGAAAGACCCTAGCCGGCCC 
AAAGGGGCCAATTACCCAA 

ATGTACACCAATGTAOACCTGGACCTCGTCGGCTGGCAGGCACCCCCCGGGTCGCGTCCCC 
TGACACCATGGACCTGCGG 

CAGCTCAGACCTTTACTTGGTCACGAGACATGCTGATGTCATTCCGGTGCGCCGGCGGGG 
CGACAGTAGGGGGAGCCTAC 

CCTCTCCCAGACCAGTCTCCTACTTGAAGGGCTCCTCGGGTGGTCCACTGCTCTaCCCTTC 
GGGGGACGCTGTTGGCATC 

TTTCGGGCTGCTGTATGCACCCGGGGGOTTGCGAAGGCGGTGGACTTCATACCCGTTGAA 
TCTATGOAAACTACTATGCG 

GTCTCCGGTCTTCACAGATAACTCAACCGGAGGAGGAGGCGCGCAAGAGGAAGCCAGTGA 
GGACGTCGTCTGCTGCGGAG 

CTCAGGTTTCATCACAGAAAGTGGGCGCACATGAAAACTCAAATAGAGCGTATGGTGGT 
TCTACGATTAATTACACCACC 

ATTAATTATTATAGAGATTCAGCTAGTAACGCGGCTTCGAAACAGGACTTCTCTCAAGA 
CCCTTCCAAGTTCACCGAGCC 

CATCAAGGATGTCCTGATAAAAACAGCCCCAATGCTAAACTCGCCAAACATAGAGGCTT 
GCGGGTATAGCGATAGAGTAC 

TGCAATTAACACTGGGAAACTCCACTATAACCACACAGGAGGCGGCTAATTCAGTAGTC 
GCTTATGGGCGTTGGCCTGAA 

TATCTGAGGGACAGCGAAGCCAATCCAGTGGACCAGCCGACAGAACCAGACGTCGCTGCA 
TGCAGGTTTTATACGCTAGA 

CACCGTGTCTTGGACGAAAGAGTCGCGAGGGTGGTGGTGGAAGTTGCCTGATGCACTGAG 
GGACATGGGACTCTTTGGGC 

AAAATATGTACTACCACTACCTAGGTAGGTCCGGGTACACCGTGCATGTACAGTGTAAC 
GCCTCCAAATTCCACCAGGGG 

GCACTAGGGGTATTCGCCGTACCAGAGATGTGTCTGGCCGGGGATAGCAACACCACTACC 
ATGCACACCAGCTATCAAAA 

TGCCAATCCTGGCGAGAAAGGAGGCACTTTCACGGGTACGTTCACTCCTGACAACAACCA 
GACATCACCTGCCCGCAGGT 

TCTGCCCGGTGGATTACCTCCTTGGAAATGGCACGTTGTTGGGGAATGCCTTTGTGTTCC 
CGCACGAGATAATAAACCTA 

CGGACCAACAACTGTGCTACACTGOTACTCCCTTACGTGAACTCCCTCTCGATAGATAGT 
ATGGTAAAGCACAATAATTG 

GGGAATTGCAATATTACCATTGGCCCCATTAAATTTTGCTAOTGAGTCCTCCCCAGAGA 
TTCCAATCACCTTOACCATAG 
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FIG. 6B 



CCCCTATGTGCTGTGAGTTCAATGGATTAXGAAACATCACCCTOCCACOCTTACAGGGCC 
TGCCGGTCATGAACACCCCT 

GGTAGCAATCAATATCTTACTGCAGACAACTTCCAGTCACCGTOTGCGCTGCCTGAATT 
TGATGTGACCCGACCTATTGA 

CATACCCGGTGAAGTAAAGAACATGATGGAATTGGCAGAAATCGACACCATGATTCCCT 
TTGACTTAAGTGCCACAAAAA 

AGAACACCATGaAAATGTATAGGGTTCGGTTAAGTGACAAACCACATACAGACGATCCC 
ATACTCTOCCTOTCACTCTCT 

CCAGCTTCAGATCCTAGGTTGTCACATACTATGCTTGGAGAAATCCTAAATTACTACAC 
ACACTGGGCAGGATCCCTGAA 

GTTCACGTTTCTGTTCTGTGGATCCATGATGGCAACTGGCAAACTGTTGGTGTCATACG 
CGCCTCCTGGAGCCGACCCAC 

CAAAGAAGCGTAAOGAGGCGATGTTGGGAACACATGTGATCTGGGACATAGGACTGCAG 
TCCTCATGTACTATGGTAGTG 

CCATGGATTAGCAACACCACGTATCGGCAAACCATAGATGATAGTTTCACCGAAGGCGG 
ATACATCAGCGTCTTCTACCA 

AACTAGAATAOTCGTCCCTCTTTCGACACCCAGAGAGATGGACATCCTTGGTTTTGTGT 
CAGCGTGXAATGACTTGAGCG 

TGCGCTTGTTGCGAGATACCACACATATAGAOCAAAAAGCGCTAGCACAGGGGTTAGGT 



CAOATGCTTGAAAOCATGATT 

GACAACACAGTCCGTGAAACGGTGGGGGCGGCAACATCTAGAGACGCTCTCCCAAACACT 




TTCTGATACAGTGCAAACGA 

OACATGTTGTACAACATAGGTCAAGGTCAOAGTCTAGCATAGAGTCTTTCTTCGCGCGG 



GGTGCATGCGTGACCATTATG 

ACCGTGOATAACCCAGCTTCCACCACGAATAAGOATAAGCTTTTTGCAGTGTGGAAGAT 
CACTTATAAAGATACTGTrCCA 

GTTACGGAGGAAATTGGAGTTCTTCACCTATTCTAGATTTGATATGGAACTTACCTTTG 
TOGTTACTGCAAATTTCACTG 

AGACTAACAATGGCCATGCATTAAATCAAGTGTACCAAATTATGTACGTACCACCAGGC 
GCTCCAOTGCCCGAAAAATOG 

GACGACTACACATGOCAAACCTCATCAAATCCATCAATCTTTTACACCTACGGGACAGCT 
CGAGCCCGGATCTCGGTACC 

GTATGTTGGTATTTCGAACGCCTATTCACACTTTTACGACGGTTTTTCCAAAGTACCAC 
TGAAGOACCAGTCGGCAGCAC 

TAGGTaACTCCCTTTATGGTGCAGCATCTCTAAATGACTTCGGTATTTTGGCTGTTAGA 
GTAGTGAATOATCACAACCCG 

ACCAAGGTCACCTCCAAAATCAGAGTGTATCTAAAACCCAAACACATCAGAGTCTGGTG 
CCCGCGTCCACCaAGGGCAGT 

GGCOTACTACGGCCCTGGAGTGGATTACAAGGATGGTACGCTTACACCCCTCTCCACCAA 
GGATCTOACCACATATGGAT 

TCGGACAC C AAAAC AAAGC GGTGTAC ACTGC AGGTT ACAAAATTTGC AACT AC C ACTT G 
GCCACTCAGOATGATTTGCAA 

AACGCAGTGAACGTCATGTGOAGTAGAGACCTCTTAGTCACAGAATCAAGAGCCCAGGG 
CACCOATTCAATCGCAAGGTG 

CAATTGCAACGCAGGGGTGTACTACTGCGAGTCTAGAAOGAAATACTACCCAGTATCCT 
TCGTTGGCCCAAC GTTC CAGT 

ACATGGAGGCTAATAACTATTACCCAGCTAGGTACCAGTCCCATATGCTCATTGGCCAT 
GGATTCGCATCTCCAGGGGAT 

TGTGGTGGCATACTCAGATGTCACCACaOGGTGATAGGGATCATTACTGCTGGTGGAGA 
AGGGTTGGTTGCATTTTCAGA 

CATTAGAGACTTGTATGCCTACGAAGAAGAAGCCATGGAACAAGGCCTCACCAATTACA 
TAQAQTCAC T ' l ' Q GQQCCrGCAT 

TTGGAAGTGGATTTACTCAGCAGATTAGCGACAAAATAACAGAGTTGACCAATATGGTG 
ACCAGTACCATCACTGAAAAG 

CTACTTAAGAACTTGATCAAGATCATATCCTCACTAGTTATTATAACTAGGAACTATGA 
AGACACCAGAACAGTGCTCGC 
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FIG. 6C 



XACCCXGGCCCXXCXTGGGXGXGAXGCXXCACCAXGGGAGXGGCXXAGAAAGAAAGCAXG 
CGATGTTCTGGAGATACCTT 

ATGTCATCAAGCAAGGTGACAGTTGGTTGAAGAAGXTTACTGAAGCAXGCAACGCAGCT 
AAGGGCCXGGAGXG&GXGXCA 

AACAAAATCTCAAAATTCATTGATTGGCTCAAGGAGAAAATTATCCCACAAGCTAGAGA 
TAAGTTGOAATTTGTAACAAA 

ACTXAGACAACTAGAAAXGCXGGAAAACCAAAXCXCAACXAXACACCAAXCAXGCCCXA 
qT CA G*?* A C ft A 4?Q AAATTC 

TATTCAATAATGTCAGATGGTTATCCATCCAGTCTAAGAGGTTTGCCCCTCTTTACGCA 
GTGGAAGCCAAAAGAATACAG 

AAACTCGAGCATACTATTAACAACTACATACAGTTCAAGAGCAAACACCGTATTGAACC 
AGTATGTTTGCTAGTACATGG 

CAGCCCCGGAACAGGTAAATCTGTAGCAACCAACCTGATTGCTAGAGCCATAGCTGAAA 
GAOAAAACACOTCCACOTACT 

CGCTACCCCCGGATCCATCACACTTCGACGGATACAAACAACAGGGAGTGGTGATTATGG 
ACGACCTGAATCAAAACCCA 

GATGGTGCGGACATGAAGCTGTTCTGTCAGATGGTATCAACAGTGGAGTTTATACCACC 
CATGGCATCCCTGGAGGAGAA 

AGGAATCCTGTTTACTTCAAATTACGTTCTAGCATCCACAAACTCAAOCAGAATTTCCC 
CCCCCACTGTGGCACACAGTG 

ACGCGTTAOCCAOaCGCTTTGCGTTCGACATGGACATTCAGGTCATGAATGAGTATTCT 
AGAGATGGGAAATXGAACATG 

GCCATGGCTACTGAAATGTGTAAGAACTGTCACCAACCAGCAAACTTTAAGAGATGCTG 
TCCTTTAGTGTGTGGXAAGGC 

AATTCAATTAATGGACAAATCTTCCAGAGTTAGATACAGTATTGACCAGATCACTACAA 
XGATXATCAAXGAGAGAAACA 

GAAGATCCAACATTGGCAATTGTATGGAGGCTTTGTTTCAAGGACCACTCCAGTATAAA 
GACTTGAAAATTGACATCAAG 

ACGAGTC CC CCTC CTGAATGTATCAATGACTTGCTCCAAGCAGTTGACTCCCAGGAGGTG 
AGAQATXACXGXGAGAAGAA 

GGGTTGGATAGTTAACATCACCAGCCAGGTTCAAACAGAAAGGAACATCAACAGGGCAA 
TGACAATTCTACAAGCGGTOA 

CAACCTT C GC CGCAGTGGCTGGAGTTGTCTATGTCATGTATAAACTGTTTGCTGGACAC C 
AGGGAGCATACACTGGTTTA 

CCAAACAAAAAACCCAACGTGCCGACCATTCGGACAGCAAAGGTACAAGGACCAGGGTTC 
GATTACGCAGXGGCXAXGGC 

TAAAAGAAACATTGTTACAGCAACTACTAGCAAGGGAGAGTTCACTATGTTAOGAGTCC 
ACOAgAACgTGGCTATTTTAC 

CAACCCACGCTTCACGTGGTGAAAGCATTGTGATCGATGGCAAAGAAGTGGAGATCTTG 
GATGCCAAAGCGCTCGAAOAT 

CAAGCAGGAACCAATCTTGAAATCACTATAATCACTCTAAAGAGAAATGAAAAGTTCAG 
AGACATTAOACCACATATACC 

TACTCAAATCACTGAGACAAATOATGGAGTCTTGATCGTGAACACTAGCAAGTACCCCA 
ATATGTATGTTCCTGTCGGTG 

CTGTGACTGAACAGOGATATCTAAATCTCOGXGGGCGCCAAACTGCXCGTACTCTAATG 
XACAACXXXCCAACGAOAGCA 

GGACAGXGXGGXGGAGXCATCACAXGXACXGGGAAAGXCAXCGGGAXGCAXGXTGGXGG 
GAACGGXXCACACGGGTTXGC 

AGCGGCCCXGAAGCGAXCAXACXXCACXCAGAGXCAAGGTGAAAXCCAGXGGATGAGAC 
CXXCGAAGGAAGXGGGATAXC 

CAAXCATAAAXGCCCCGTCCAAAACCAAGCXTGAACCCAGXGCXXXCCACXAXGXGXXXG 
AAGGGGXGAAGGAACGAGCA 

GXCCXCACXAAAAACGAXCCCAGGCTXAAGACAGACXXXGAGGAGGCAAXXXXCXCCAA 
GXACGTGGGXAACAAAAXXAC 

TGAAGXGGAXGAGXACATGAAAGAGGCAGXAGACCACXATGCXGGCCAGCXCAXGTCAC 
TAGAGAXCAACAGAGAACAAA 

TGTGCXXGGAGGAXGCCAXGXAXGGCACXOAXGGXCXAGAAGCACXXGAXXXGXCCACC 
AGTGCTGGCXACCCXXAXGXA 
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FIG. 6D 



aCAATGOGAAAGAAGAAGAGAGACATCTTGAACAAACAAACCAGAGACACTAAGGAAA 
TGCAAAAACTGCTCGACACATA 

TGGAATCAACCTCiCCACTGGTGACTTATGTAAAGGAXGAACTTAGATCCAAAACAAAGG 
TTGAGGAGGGGAAAXCCAGAT 

TAATTGAAGCTTCTAGTTTGAATGACTCAGTGGCAATGAGAATGGCTTTTGGGAACCTA 
TATGCTGCTTTTCACAAAAAC 

CCAGGAGTGATAACAGGTTCAGCAGTGGGGTGCGATCCAGATTTGTTTTGGAGCAAAAT 

TCCGGTATTGATGGAAGAGAA 

GCTGTTTGCTTTTGACTACACAGGGTATGATGCATCTCTCAGCCCTGCTTGGTTCGAGGC 
ACTAAAGATGGTGCTTGAGA 

AAATCGGATTCGGAGACAGAGTTGACTACATCGACTACCTAAACCACTCACACCACCTGT 
ACAAGAATAAAACATACTGT 

GTCAAGGGCGGTATGCCATCTGGCTGCTCAGGCACTTCAATTTTTAACTCAATGATTAA 
CAACTTGATTATCAGGACACT 

CTTACTGAAAACCTACAAGGGCATAGATTTAGACCACCTAAAAATGATTGCCTATGGTG 
ATGATGTAATTGCTTCCTACC 

CCCATGAAGTTGACGCTAGTCTCCTAGCCCAATCAGGAAAAGACTATGGACTAACTATG 
ACTCCAGCTOACAAATCAGCT 

ACATTTGAAACAGTCACATGGGAGAATGTAACATTCTTGAAGAGATTCTTCAGGGCAGA 
CGAGAAATACCCATTTCTTAT 

TCATCCAGTAATGCCAATGAAGGAAATTCATGAATCAATTAGATGGACTAAAGATCCTA 
GGAACACTCAGGATCACGTTC 

GCTCTCTGTGCCTTTTAGCTTGGCACAATGGCGAAGAAGAATATAACAAATTCCTAGCT 
AAAATCAGGAGTGTGCCAATT 

GGAAGAGCTTTATTGCTCCCAGAGTACTCAACATTGTACCGCCGTTGGCTTGACTCATT 
TTAGTAACCCTACCTCAGTCG 



AATTGGATTGGGTCATACTGTTGTAGGGGTAAATTTTTCTTTAATTCGGAGGAAAAAA 
AAAAAAAAAAAAAAAAAAAAAA 




CCTCGTGATACGCCTATTTTTATAGGTTAATGTCATGATAATAATGGTTTCTTAGACGT 
GAGGTGGCACTTTTCGGGGAA 

ATGTGCGCGGAACCCCTATTTGTTTATTTTTCTAAATACATTCAAATATGTATCCGCTC 
ATGAGACAATAACCCTGATAA 

ATGCTTCAATAATATTGAAAAAGGAAGAGTATGAGTATTCAACATTTCCGTGTCGCCCT 
TATTCCCTTTTTTGCGGCATT 

TTGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGGTGAAAGTAAAAGATGCTGAAGATC 
AGTTGGGTGCACGAGTGGGTT 

ACATCGAACTGGATCTCAACAGCGGTAAGATCCTTGAGAGTTTTCGCCCCGAAGAACGT 
TTTCCAATGATGAGCACTTTT 

AAAGTTCTGCTATGTGGCGCGGTATTATCCCGTGTTGACGCCGGGCAAGAGCAACTCGGT 
CGCCGCATACACTATTCTCA 

GAATGACTTGGTTGAGTACTCACCAGTCACAGAAAAGCATCTTAGGGATGGCATGACAG 
TAAOAGAATTATGCAGTGCTG 

CCATAACCATGAGTGATAACACTGCGGCCAACTTACTTCTGACAACGATCGGAGGACCGA 
AGGAGCTAACCGCTTTTTTG 

CACAACATGGGGGATCATGTAACTCGCCTTGATCGTTGGGAACCGGAGCTGAATGAAGCC 
ATACGAAACOACGAGCGTGA 

CACCACGATGCCTGCAGCAATGGCAACAACGTTGCGCAAACTATTAACTGGCGAACTACT 
TACTCTAGCTTCCCGOCAAC 

AATTAATAGACTGGATGGAGGCGGATAAAGTTGCAGGACCACTXCTGCGCTCGGCCCTT 
CCGGCTGGCTGGTTTATTGCT 

GATAAATCTGGAGCCGGTGAGCGTGGGTCTCGCGGTATCATTGCAGCACTGGGGCCAGAT 
GGTAAGCCCTCCCGTATCGT 

AGTTATCTACACGACGGGGAGTCAOGCAACTATGGATGAACGAAATAGACAGATCGCTG 
AGATAGGTGCCTCACTOATTA 

AGCATTGGTAACTGTCAGACCAAGTTTACTCATATATACTTTAGATTGATTTAAAACT 
TCATTTTTAATTTAAAAGGATC 
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FIG. 6E 

TAGGTGAAGATCCTTTTTQATAATCTCATQACCAAAATCCCTTAACGTGAGTTTTCGTT 
CCACTGAGCGTCAGACCCCGT 

AGAAAAGATCAAAQGATCTTCTTGAGATCCTTTTTTTCTGCGCGTAATCTGCTGCTTGC 
AAACAAAAAAACCACCGCTAC 

CAGCTGGTTTGTTTGCCGGATCAAGAGCTACCAACTCTTTTTCCGAAGGTAACTGGCTTC 
AGCAGAGCGCAGATACCAAA 

TACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCACCACTTCAAGAACTCTGTAGCACCGCC 
TACATACCTC6CTCT0CTAA 

TCCTOTTACCAOTOGCTOCTOCCAGTGGCQATAAGTCOTOTCTTACCGOGTTGGACTCAA 
GACGATAGTTACC GGATAAG 

aCGCAGCGGTCGGGCTGAACGOGGGGTTCGTGCACACAOCCCAGCTTGGAGCGAACGACC 
TACACCGAACTGAGATACCT 

ACAGCGTGAGCATTGAGAAAGCGCCACGCTTCCCGAAGGGAGAAAGGCGGACAGGTATCC 
GGTAAGCGGCAGGGTCGGAA 

CAGGAGAGCGCACGAGGGAGCTTCCAGGGGGAAACGCCTGGTATCTTTATAGTCCTGTCG 
GGTTTCGCCACCTCTGACTT 

GAGCGTCGATTTTTGTGATGCTCGTCAGGGGGGCGGAGCCTATGGAAAAACGCCAGCAAC 
GCGGCCTTTTTACGGTTCCT 

GGCCTTTTGCTGGCCTTTTGCTCACATGTTCTTTCCTGCGTTATCCCCTGATTCTGTGGA 
TAACCGTATTACCGCCTTTG 

AGTGAGCTGATACCGCTCGCCGCAGCCGAACGACCGAGCGCAGCGAGTCAGTGAGCGAGG 
AAGCGGAAGAGCGCCTGATG 

CGGTATTTrCTCCTTACGCATCTGTGCGGTATTTCACACCGCATATGGTGCACTCTCAGT 
ACAATCTGCTCTGATGCCGC 

ATAGTTAAGCCAGTATACACTCCGCTATCGCTACGTGACTGGGTCATGGCTGCGCCCCGA 
CACCCGCCAACACCCGCTGA 

CGCGCCCTGACGGGCTTGTCTGCTCCCGGCATCCGCTTACAGACAAGCTGTGACCGTCTCC 
GGQAGCTGCATGTGTCAGA 

GGTTTTCACCGTCATCACCGAAACGCGCGAGGCAGCTGCGGTAAAGCTCATCAGCGTGGT 
CGTGAAGCGATTCACAGATG 

TCTGCCTGTTCATCCGCGTCCAGCTCGTTGAGTTTCTCCAGAAGCGTTAATGTCTGGCTT 
CTGATAAAjGCGGGCCATGTT 

AAGGGCGGTTTTTTCCTGTTTGGTCACTTGATGCCTCCGTGTAAGGGGGAATTTCTGTT 
CATGOGGGTAATGATACCGAT 

GAAACGAGAGAGGATGCTCACGATACGGGTTACTGATGATGAACATGCCCGGTTACTGG 
AACGTTGTOAOGOTAAACAAC 

TGGCGGTATGGATGCOGCGGGACCAGAGAAAAATCACTCAGGGTCAATGCCAGCGCTTCG 
TTAATACAGATGTAGGTGTT 

CCACAGGGTAGCCAGCAGCATCCTGCGATGCAGATCCGGAACATAATGGTGCAGGGCGCT 

Gjvcnrcc gcgtt^tc? cj^GsAc^t 

ttacgaaacacggaaaccgaagaccattcatgttgttgctcaggtcgcagacgttttgc 
agcagcagtcgcttcacgttc 

gctcgcgtatcggtoattcattctoctaaccaotaaggcaaccccgccaocctagccggg 

TCCGCTCTCCCTTATGCGAC 

TCCTGCATTAGGAAGCAGCCCAGTAGTAGGTTGAGGCCGTTGAGCACCGCCGCCGCAAGG 
AATGGTGCATGCAAGGAOAT 

GGCGCCCAACAGTCCCCCGGCCACGOGGCCTGCCACCATACCCACOCCGAAACAAGCGCTC 
ATGAGCCCGAAGTGGCOAG 

CCCGATCTTCCCCATCGGTGATGTCGGCGATATAGGCGCCAGCAACCGCACCTGTGGCGCC 
GGTGATGCCGGCCACGATG 

CGTCCGGCGTAGAGGATCCCGCOAAATTAATACaACTCACTATAOG 
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